The following datasets will be used in the course:


titanic

Real data on 963 passengers on the Titanic


airsat

Real data on 10,000 customers of an airline


iris

Real data on 150 iris flowers.


globalWarm

Real data on emotions, ideology, and party affiliation as predictors of attitudes towards government action on climate change.


heart

Real data on risk for heart disease.


ahd

Real data on Alzheimer’s Disease.


hcp_memory

Real neuroimaging data from the Human Connectome Project used to predict scores on a memory test. Note: artificially modified to increase predictive power and make activities more engaging.


water

Real data on 3276 different water bodies. Modified to turn Potability from a numeric variable (dummy code) into a character variable.


attrition

Simulated data on 1470 fictional employees who either quit their job (attrition = yes) or did not (attrition = no).